Performance Bug Analysis and Detection for Distributed Storage and Computing Systems
نویسندگان
چکیده
This article systematically studies 99 distributed performance bugs from five widely deployed storage and computing systems (Cassandra, HBase, HDFS, Hadoop MapReduce ZooKeeper). We present the TaxPerf database, which collectively organizes analysis results as over 400 classification labels 2,500 lines of bug re-description. is classified into six categories (and 18 subcategories) by their root causes; resource, blocking, synchronization, optimization, configuration, logic. can be used a benchmark for debug tool designs. Although it impractical to automatically detect all in TaxPerf, we find that an important category blocking effectively solved tools. analyze cascading nature design automatic detection called PCatch , (i) performs program identify code regions whose execution time potentially increase dramatically with workload size; (ii) adapts traditional happens-before model reason about software resource contention dependency relationship; (iii) uses dynamic tracking whether slowdown propagation contained one job. Evaluation shows accurately representative observing system executions under small-scale workloads.
منابع مشابه
Hybrid Regenerating Codes for Distributed Storage Systems
Distributed storage systems are mainly justified due to their ability to store data reliably over some unreliable nodes such that the system can have long term durability. Recently, regenerating codes are proposed to make a balance between the repair bandwidth and the storage capacity per node. This is achieved through using the notion of network coding approach. In this paper, a new variation ...
متن کاملPerformance Evaluation of Distributed Storage Systems for Cloud Computing
The possibility to migrate a virtual server from one physical computer in a cloud to another physical computer in the same cloud is important in order to obtain a balanced load. In order to facilitate live migration of virtual servers, one needs to provide large shared storage systems that are accessible for all the physical servers that are used in the cloud. Distributed storage systems offer ...
متن کاملinvestigation of single-user and multi-user detection methods in mc-cdma systems and comparison of their performances
در این پایان نامه به بررسی روش های آشکارسازی در سیستم های mc-cdma می پردازیم. با توجه به ماهیت آشکارسازی در این سیستم ها، تکنیک های آشکارسازی را می توان به دو دسته ی اصلی تقسیم نمود: آشکارسازی سیگنال ارسالی یک کاربر مطلوب بدون در نظر گرفتن اطلاعاتی در مورد سایر کاربران تداخل کننده که از آن ها به عنوان آشکارساز های تک کاربره یاد می شود و همچنین آشکارسازی سیگنال ارسالی همه ی کاربران فعال موجود در...
RobuSTore: Robust Performance for Distributed Storage Systems
Emerging large-scale scientific applications involve massive, distributed, shared data collections (petabytes), and require robust, high performance for read-dominated workloads. Achieving robust performance (low variability) in storage systems is difficult. We propose RobuSTore, a novel storage technique, which combines erasure codes and speculative access to reduce performance variability and...
متن کاملcoordination of protective schemes in distribution and subtransmission systems including distributed generation
نصب منابع پراکنده در سیستم های توزیع و فوق توزیع علاوه بر مزیت های فراوان، مشکلاتی را هم به همراه دارد. یکی از این مشکلات تأثیر این منابع روی اندازه جریان های اتصال کوتاه است. به طور معمول از حفاظت های مبتنی بر اصل جریان زیاد در سیستم های توزیع و فوق توزیع استفاده می شود. از مهمترین طرح های حفاظتی استفاده شده در فیدرهای توزیع، حفاظت فیوز- ریکلوزر است. همچنین از طرح های حفاظتی جریان زیاد جهتی در...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Storage
سال: 2023
ISSN: ['1553-3077', '1553-3093']
DOI: https://doi.org/10.1145/3580281